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Nowadays parallel object-relational DBMS are envisioned as the next great wave, but 
there is still a lack of efficient implementation concepts for some parts of the proposed 
functionality. Thus one of the current goals for parallel object-relational DBMS is to move 
towards higher performance. In this paper we develop a framework that allows to process 
user-defined functions with data parallelism. We will describe the class of partitionable 
functions that can be processed parallelly. We will ... 
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The ability to create user-defined aggregate functions (UDAs) is rapidly becoming a 
standard feature in relational database systems. Therefore, problems such as query 
optimization, query rewriting and view maintenance must take into account queries (or 
views) with UDAs. There is a wealth of research on these problems for queries with 
general aggregate functions. Unfortunately, there is a mismatch between the manner in 
which UDAs are created, and the information that the database system requires ... 

Keywords: aggregate queries, query optimization, query rewriting, view maintenance 
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New applications from the areas of analytical data processing and data integration require 
powerful features to condense and reconcile available data. As outlined in [l], the general 
concept of grouping and aggregation appears to be a fitting paradigm for a number of 
these issues, but in its common form of equality based groups or with current extensions 
like simple user-defined functions to derive group-by values on a per tuple basis and 
restricted aggregate functions a number of problems remai ... 
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The problem of rewriting aggregate queries using views is studied for conjunctive queries 
, with arbitrary aggregation functions and built-in predicates. Two types of queries over 
views are introduced for rewriting aggregate queries: pure candidates and aggregate 
candidates. Pure candidates can be used to rewrite arbitrary aggregate queries. 
Aggregate candidates can be used to rewrite queries containing aggregate functions 
definable in terms of a commutative-semigroup operation. For ... 
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Full text available: c jg?) pdf (1.57 MB ) Additional Information: full citation , abstract , references , index terms 

In this paper we consider aggregate predicates and their support in database systems. 
Aggregate predicates are the predicate equivalent to aggregate functions in that they can 
be used to search for tuples that satisfy some aggregate property over a set of tuples (as 
opposed to simply computing an aggregate property over a set of tuples). The importance 
of aggregate predicates is exemplified by many modern applications that require ranked 
search, or top-k queries. Such queries are the norm ... 
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conference on Management of data SIGMOD 'OO, volume 29 issue 2 
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We investigate the problem of using materialized views to answer SQL queries. We focus 
on modern decision-support queries, which involve joins, arithmetic operations and other 
(possibly user-defined) functions, aggregation (often along multiple dimensions), and 
nested subqueries. Given the complexity of such queries, the vast amounts of data upon 
which they operate, and the requirement for interactive response times, the use of 
materialized views (MVs) of similar complexity is often mandatory ... 
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Research pa pers: estimation and a ppr oximation: Relational c onfid ence bounds are 
easy with the bootstra p 
Abhijit Pol, Christopher Jermaine 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: *g| pdf( 337.5 1 KB) Additional Information: full citation , abstract , references 

Statistical estimation and approximate query processing have become increasingly 
prevalent applications for database systems. However, approximation is usually of little 
use without some sort of guarantee on estimation accuracy, or "confidence bound." 
Analytically deriving probabilistic guarantees for database queries over sampled data is a 
daunting task, not suitable for the faint of heart, and certainly beyond the expertise of the 
typical database system end-user. This paper considers the prob ... 
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October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 
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Scalable data mining in large databases is one of today's challenges to database 
technologies. Thus, substantial effort is dedicated to a tight coupling of database and data 
mining systems leading to database primitives supporting data mining tasks. In order to 
support a wide range of tasks and to be of general usage these primitives should be 
rather building blocks than implementations of specific algorithms. In this paper, we 
describe primitives for building and applying decision tree classifi ... 

Keywords: SQL-aware data mining, data mining primitives, query operators 



1 0 Querying web me t adata: Native score mana g ement and text su p port in databases 
Gultekin OzsoyoDlu, Ismail Sengor Altingovde, Abdullah Al-Hamdani, Selma Ay§e Ozel, 
Ozgur Ulusoy, Zehra Meral ozsoyoDlu 

December 2004 ACM Transactions on Database Systems (TODS), volume 29 issue 4 
Publisher: ACM Press 

Full text available: l g] pdf(73 7.76 K B) Additional Information: full citation , abstract , references , i ndex t erms 

In this article, we discuss the Issues involved in adding a native score management 
system to object-relational databases, to be used in querying Web metadata (that 
describes the semantic content of Web resources). The Web metadata model is based on 
topics (representing entities), relationships among topics (called metalinks), and 
importance scores (sideway values) of topics and metalinks. We extend database relations 
with scoring functions and importance scores. We add to SQL score-manag ... 

Keywords: Score management for Web applications 
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Wenwei Xue, Qiong Luo, Lei Chen, Yunhao Liu 

June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 
Management of data SIGMOD '06 

Publisher: ACM Press 

Full text available: ^ pdf (389.31 KB) Additional Information: full citation , abstract , references , index terms 

Many sensor network applications, such as object tracking and disaster monitoring, 
require effective techniques for event detection. In this paper, we propose a novel event 
detection mechanism based on matching the contour maps of in-network sensory data 
distribution. Our key observation is that events in sensor networks can be abstracted into 
spatio-temporal patterns of sensory data and that pattern matching can be done 
efficiently through contour map matching. Therefore, we propose simple SQL ... 

Keywords: contour maps, event detection, patterns, sensor networks 
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June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: l fg?| pdf(249.27 KB) Additional Information: full citation, abstract , references 

The integration of the .NET Common Language Runtime (CLR) inside the SQL Server 
DBMS enables database programmers to write business logic in the form of functions, 
stored procedures, triggers, data types, and aggregates using modern programming 
languages such as C#, Visual Basic, C++, COBOL, and J++. This paper presents three 
main aspects of this work. First, it describes the architecture of the integration of the CLR 
inside the SQL Server database process to provide a safe, scalable, secure, an ... 
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1 4 Data mining (DM): Embedded pr edictive modelin g in a parallel r elational database 
A. Dorneich, R. Natarajan, E. Pednault, F. Tipu 

April 2006 Proceedings of the 2006 ACM symposium on Applied computing SAC '06 
Publisher: ACM Press 

Full text available: c g) pdf( 224.69 KB ) Additional Information: full citation , abstract , references , index terms 

A methodology for embedding predictive modeling algorithms in a commercial parallel 
database is described; specifically, the parallel editions of IBM DB2 Universal Database, 
although many aspects of the overall approach can be used with other commercial parallel 
databases. This parallelization approach was implemented in the Version 8.2 release of 
DB2 Intelligent Miner Modeling to support a new predictive modeling algorithm called 
Transform Regression. This database-embedded mining algorithm pro ... 

Keywords: data mining, embedded analytics, parallel databases, predictive modeling, 
transform regression 
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16 Online aggregation 
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conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available* "PI odfd 92 MB) Additional Information: full citation , abstract , references , citings, index 
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Aggregation in traditional database systems is performed in batch mode: a query is 
submitted, the system processes a large volume of data over a long period of time, and, 
eventually, the final answer is returned. This archaic approach is frustrating to users and 
has been abandoned in most other areas of computing. In this paper we propose a new 
online aggregation interface that permits users to both observe the progress of their 
aggregation queries and control execution on ... 
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Data mining on large data warehouses is becoming increasingly important. In support of 
this trend, we consider a spectrum of architectural alternatives for coupling mining with 
database systems. These alternatives include: loose-coupling through a SQL cursor 
interface; encapsulation of a mining algorithm in a stored procedure; caching the data to 
a file system on-the-fly and mining; tight-coupling using primarily user-defined functions; 
and SQL implementations for processing in the DBMS. We ... 
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November 2004 Proceedings of the 12th annual ACM international workshop on 
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Publisher: ACM Press 

Full text available: *g| pdf( 191.98 KB ) Additional Information: full citation , abstract , references , index terms 

Modern database applications including computer-aided design, multimedia information 
systems, medical imaging, molecular biology, or geographical information systems impose 
new requirements on the effective and efficient management of spatial data. Particular 
problems arise from the need of high resolutions for large spatial objects and from the 
design goal to use general purpose database management systems in order to guarantee 
industrial-strength. In the past two decades, various stand-alon ... 

Keywords: data management, object decomposition, object-relational database, spatial 
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November 1999 Proceedings of the eighth international conference on Information 
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Publisher: ACM Press 
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Large scale data analysis and mining activities require sophisticated information extraction 
queries. Many queries require complex aggregation, and many of these aggregates are 
non-distributive. Conventional solutions to this problem involve defining User Defined 
Aggregate Functions (UDAFs). However, the use of UDAFs entails several problems. 
Defining a new UDAF can be a significant burden for the user, and optimizing queries 
involving UDAFs is difficult because of the "black box&rdqu ... 
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V/ June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 
Management of data SIGMOD '06 

Publisher: ACM Press 

Full text available: 1 ^ pdf ( 278.59 KB) Additional Information: full citation , abstract , references , index terms 

In many document collections, documents are related to objects such as document 
authors, products described in the document, or persons referred to in the document. In 
many applications, the goal is to find these objects that best match a set of keywords. 
However, the keywords may not necessarily occur in the target objects; they occur only in 
the documents. For example, in a product review database, a user might search for 
names of products (say, laptops) using keywords like "lightweight" and "... 

Keywords: aggregation, early termination, keyword search, named entities, ranking, 
relationships, top-k queries 
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